Back

Molecular Ecology Resources

Wiley

Preprints posted in the last 30 days, ranked by how well they match Molecular Ecology Resources's content profile, based on 161 papers previously published here. The average preprint has a 0.06% match score for this journal, so anything above that is already an above-average fit.

1
Portable, multilocus DNA barcoding across the diversity of meiofauna

Keene, D.; Arya, S.; Walker, B.; Laumer, C. E.

2026-05-22 zoology 10.64898/2026.05.20.726206 medRxiv
Top 0.1%
69.8%
Show abstract

Molecular data have revolutionised taxonomic and ecological research on the hyperdiverse communities of aquatic benthic microinvertebrates known as meiofauna. However, reference sequence databases remain highly incomplete, with variable barcode genes or fragments studied from taxon to taxon. Furthermore, there is a typical tradeoff between universality of primers and phylogenetic resolution, with rRNA markers being robustly recoverable but failing to resolve species-level divergences, and mitochondrial markers showing the reverse trend. Here, we introduce Oxford Nanopore rRNA and COI amplicon sequencing (OrCa-seq), a rapid, low-cost protocol for parallel long-range PCR amplification and multiplexed sequencing of four amplicons, spanning the nearly-complete rRNA cistron ([~]7-8 kb) and the widely studied Folmer region of COI (represented as overlapping 313 and 658 bp amplicons). This protocol, with its associated bioinformatic workflow, was designed for conducting biodiversity inventories of meiofauna and can be easily carried out in field research and educational contexts, with data available from 96-well plates of specimens within a day of lysis. To validate the method, we processed six plates of student-isolated freshwater and limno-terrestrial meiofauna, characterising the recovery of target genes and taxa with both automated and human-curated BLAST database comparisons. These data demonstrate the universal applicability of OrCa-seq across effectively all meiofauna, including the very smallest species. Nonetheless, recovery efficiency for each amplicon shows variation by taxon, with the full-length Folmer COI amplicon standing out as the most challenging. We present exemplar phylogenetic trees integrating reference sequences, demonstrating the utility of these data in confirming morphological determinations and in identifying anonymous specimens in a reverse taxonomy context. While developed in a specific educational context for use on meiofauna, the OrCa-seq approach should be readily scalable to larger research datasets, adaptable to many specimen types, and to any combination of taxon-or target-specific primers. As such, it represents a compelling multi-locus extension to the ever-growing repertoire of nanopore DNA barcoding protocols.

2
From fluke to fragment: a multifaceted method for molecular sex identification and mitochondrial haplotyping from environmental DNA samples

Rodriguez, L. K.; Schallhart, S.; Hobmeier, P.; Curran, T.; Perez-Jorge, S.; Prieto, R.; Oliveira, C.; Silva, M. A.; Thalinger, B.

2026-05-04 genomics 10.64898/2026.04.30.719183 medRxiv
Top 0.1%
58.4%
Show abstract

O_LIEnvironmental DNA (eDNA) analyses have become a powerful tool for non-invasive biodiversity monitoring, yet the applicability of population genetic approaches to environmental samples remains largely unexplored. Even when genetic traces originate from a single individual, low target DNA concentrations and amplification or sequencing artefacts can compromise downstream genetic inferences. Here, we present a novel approach for obtaining demographic insights and lineage-level mitogenomic information from aquatic eDNA samples collected near vertebrate individuals. C_LIO_LIPaired eDNA and tissue samples were collected during sperm whale (Physeter macrocephalus) encounters in the Azores. Samples were screened for the presence of vertebrate eDNA and analyzed with a novel molecular sex identification assay. Additionally, long-range PCR was used to amplify up to five mitochondrial DNA fragments ([~]3-4k bp) before subsequent sequencing on an Oxford Nanopore Technologies platform. A stringent three-tier filtering framework capable of identifying true mitogenomic variation across eDNA samples was developed for maximum recovery of genetic diversity at the haplogroup level. By benchmarking eDNA samples via their paired tissues, parameter values were optimized to maximize concordance and minimize spurious variant calls. C_LIO_LISexing was successful for 50% of eDNA samples, with 96% concordance to paired tissues, and marine vertebrate DNA concentration significantly predicted sexing success. Further, Medaka polishing produced high identity mitochondrial consensus sequences (>16 kb) from eDNA samples. Across filtering regimes in the framework, curated SNP panels comprising up to 453 high-confidence mitochondrial SNPs resolved 19 haplogroups, with 93% concordance between eDNA and tissue samples. An intermediate bioinformatics filtering strategy maximized biologically accurate haplogroup recovery while minimizing sequencing artefacts, providing the most reliable lineage-level inferences. C_LIO_LIThis integrative approach demonstrates that targeted nuclear assays combined with long-range mitochondrial sequencing can recover individual-level genetic information from aquatic eDNA. By defining analytical thresholds governing success, the framework advances non-invasive genetic monitoring of populations via eDNA and enables population-level monitoring and conservation of endangered and genetically-vulnerable species. C_LI

3
A new method based on genome alignments provides a highly resolutive target enrichment set for weevils (Coleoptera, Curculionoidea)

ZELVELDER, B.; BENOIT, L.; LOISEAU, A.; HARAN, J.; ALLIO, R.

2026-05-13 evolutionary biology 10.64898/2026.05.09.724036 medRxiv
Top 0.1%
49.4%
Show abstract

Target enrichment methods have provided unprecedented advances in phylogenomics. Targeting hundreds of conserved regions has proven to be a good tradeoff between cost and efficiency, while being useful for museomics and diversified non-model clades. Unfortunately, current methods used for identifying such regions involve high degrees of conservation within targeted elements, usually pushing researchers to rely on flanking data with little guarantee for homology. With a growing number of high quality genomes available throughout the Tree of Life emerges new opportunities to improve marker selection. In this study, we introduce GABBI, a new method for designing target capture probes by taking advantage of genome alignments, avoiding the selection of a single reference genome that can cause notable biases. We compare GABBI-derived markers to the most commonly used probe design method, PHYLUCE, at two taxonomic scales, the weevil superfamily Curculionoidea and the tribe Pachyrhynchini. At both taxonomic scales, results show that our new method allows identifying more variable loci that prove to be more phylogenetically resolutive than the PHYLUCE-derived ones. Doing so, we provide the first probe set specifically designed for weevils, targeting a wide set of 4,255 shared homologous regions, encouraging future research on systematics and macroevolution of one of the most diverse and economically important groups of insects. By providing GABBI as an automated and open-access pipeline, we hope to open new probe design opportunities to other taxonomic groups that face similar phylogenetic obstacles.

4
Individual natal assignment in highly migratory species: the genomic baseline and its application in loggerhead turtles

Luna-Ortiz, A.; Barbanti, A.; Pegueroles, C.; Abreu-Grobois, F. A.; Casale, P.; Freggi, D.; Giralt, S.; Labastida-Estrada, E.; Llera-Herrera, R.; Machkour-M'Rabet, S.; Marco, A.; Margaritoulis, D.; Turkozan, O.; Pascual Berniola, M.; Carreras, C.

2026-05-10 evolutionary biology 10.64898/2026.05.06.723276 medRxiv
Top 0.1%
28.5%
Show abstract

O_LIEffective conservation of highly migratory species requires understanding genetic structure across breeding populations and access high{square}resolution markers capable of assigning individuals from mixed aggregates (e.g. bycatch or new nesting sites) to their natal origins. Genomic approaches provide unprecedented resolution but add methodological challenges; thus, it is essential to first build a genomic baseline from known breeding areas and then evaluate strategies for assigning unknown individuals. C_LIO_LITo address this, we used 2b-RAD sequencing, a genomic reduction technique useful for degraded DNA, and loggerhead turtles as a case study. This species shows philopatric breeding, while juveniles and adults form mixed aggregations in foraging grounds. C_LIO_LIOur results highlight the importance of building baselines that include all potential source populations contributing to mixed aggregations. We detected hierarchical genetic differentiation and high resolution and successfully assigned the natal origin of 124 unknown individuals from four Mediterranean foraging grounds. These grounds showed distinct source contributions, and comparisons with previous studies suggest possible temporal shifts in stock composition. C_LIO_LIWe provide a comprehensive genomic baseline for individual assignment of Altanto-Mediterranean loggerhead turtles of unknown natal origin and a general framework for identifying population-specific threats in highly migratory species. C_LI

5
Molecular Star Gazing: Development and Validation of an Environmental DNA Assay for the Imperiled Sunflower Sea Star (Pycnopodia helianthoides)

Gold, Z.; Robinson, K. M.; Gehman, A.-L. M.; Shea, M. M.; Lemay, M. A.; Weinrich, J.; Kellogg, C. T. E.; Clemente-Carvalho, R. B. G.; Schiebelhut, L. M.; Boehm, A. B.; Kidd, A.; Kim, A.; Hodin, J.; Dawson, M.; McAllister, S. M.

2026-05-12 molecular biology 10.64898/2026.05.07.723600 medRxiv
Top 0.1%
23.1%
Show abstract

The sunflower sea star (Pycnopodia helianthoides) suffered a catastrophic population decline across its range from 2013 to 2017 due to the devastating Vibrio pectenicida FHCF-3 driven sea star wasting disease (SSWD) pandemic with minimal signs of population recovery. The functional extinction of this apex predator across substantial parts of its range has created a need to identify and track the remaining intact populations. Environmental DNA (eDNA) approaches provide a simple, cost-effective, and non-destructive method for monitoring occurrences, and in some cases abundances, of marine species, consistently outperforming visual occurrence monitoring efforts in sensitivity, speed, and cost. Here, we designed, developed, and validated a P. helianthoides-specific eDNA assay to identify refugia, using both quantitative and digital droplet PCR approaches. We first generated the most comprehensive sea star mitochondrial genome reference database to date (n=93 taxa, n= 15 novel). We then used unikseq and Geneious bioinformatics software to identify the unique nad5 gene region and design a highly specific hydrolysis probe-based PCR assay. We validated the performance of this assay through laboratory, mesocosm, and field testing, demonstrating a highly specific and sensitive assay. In a field application of the new assay across regions in British Columbia, Canada, we found a positive correlation between P. helianthoides eDNA concentrations and biomass density, especially when appropriately accounting for spatiotemporal integration scales (R2=0.67). The eDNA assay provides a rapid and scalable tool for monitoring the sunflower sea star which has been proposed for listing as threatened under the U.S. Endangered Species Act of 1973. Molecular tools like the one presented here enhance management and recovery efforts not only by identification and monitoring of remnant wild populations, but also by helping to assess population level response and recovery following reintroduction efforts.

6
kinference: Pairwise kinship detection for Close-Kin Mark-Recapture

Bravington, M. V.; Baylis, S. M.; Eveson, P.; Feutry, P.

2026-05-21 genetics 10.64898/2026.05.18.725841 medRxiv
Top 0.1%
14.3%
Show abstract

AO_SCPLOWBSTRACTC_SCPLOWClose-Kin Mark-Recapture (CKMR) is a statistical framework for estimating demographic parameters of wild populations. Instead of recapturing individuals, it relies on the identification of closely-related pairs such as parents and offspring, or siblings. By measuring how often such close-kin are "recaptured" among sampled animals (whether alive or dead), scientists can estimate demographic parameters such as census size, mortality rates, and connectivity. CKMR is starting to change fisheries and wildlife management by giving more reliable demographic information, even for many species that resist conventional approaches. Here we introduce the kinference R package, which provides a set of tools for finding close-kin pairs among thousands of samples each genotyped at thousands of SNPs, and for associated quality control. The CKMR context implies different requirements and assumptions to many other kinship programs. In particular, kinference accounts empirically for linkage without requiring a genome assembly, is able to estimate and control false-negative and false-positive probabilities, and can cope with null alleles. The package has been developed and used in numerous CKMR projects since 2017. This paper documents the assumptions, statistical algorithms, and intended workflow for kinference.

7
Shark sexing from forensic, archival, and developmental samples using sex-linked DNA markers

Akane, O.; Kawaguchi, Y. W.; Niwa, T.; Uno, Y.; Kuraku, S.

2026-05-06 ecology 10.64898/2026.05.02.722412 medRxiv
Top 0.1%
14.2%
Show abstract

The effective management of threatened shark populations relies on accurate demographic data, particularly operational sex ratios. While sex identification in intact shark bodies is straightforward through the presence of external male organs, namely claspers, it remains impossible for processed fins in the illegal wildlife trade, early-stage embryos in breeding programs, or archived tissue fragments and blood samples where morphological traits are lost. Here, we present a robust molecular sexing framework leveraging recently identified sequences from shark sex chromosomes, consistently organized in the XY system, to our current knowledge. Our approach consists of two distinct methodologies tailored to the the current identification status of sex chromosome sequences in the target species. For the whale shark Rhincodon typus and the brownbanded bamboo shark Chiloscyllium punctatum, we employed end-point PCR assays targeting male-specific Y-linked markers. For the cloudy catshark Scyliorhinus torazame, we developed a quantitative PCR (qPCR) assay targeting differential X chromosome dosage. In this dosage-based system, females (XX) are distinguished by an amplification profile approximately one cycle earlier than males (XY). By integrating X-linked dosage quantification, our framework provides a critical internal control that significantly enhances reliability, allowing researchers to distinguish true females from PCR failures. This toolkit offers a versatile solution for diverse applications, ranging from the study of sex determination mechanisms in pre-phenotypic embryos to the reconstruction of sex ratios from space-constrained tissue archives and global wildlife forensics, thereby contributing to the comprehensive conservation of shark biodiversity.

8
Chromosome-level genome assembly and annotation of the threatened marbled teal (Marmaronetta angustirostris)

Ortego, J.; Lopez-Luque, R.; Backstrom, N.; Green, A. J.

2026-05-14 genomics 10.64898/2026.05.12.723956 medRxiv
Top 0.1%
14.0%
Show abstract

The marbled teal (Marmaronetta angustirostris) is a widely distributed but declining waterfowl species, classified as Near Threatened globally and Critically Endangered in Spain. Despite ongoing conservation actions, including ex situ management and population reinforcement programmes, the genomic consequences of long-term captivity, inbreeding, and patterns of functional genetic variation remain unknown due to the absence of a species-specific reference genome. Here, we present the first chromosome-level genome assembly for this species. The genome was generated using PacBio HiFi long reads and Omni-C data, yielding a 1.15Gb assembly with a scaffold N50 of 76.95Mb. A total of 97.16% of the assembly was anchored into 36 chromosome-scale scaffolds, including the Z and W sex chromosomes. BUSCO analysis recovered 99.2% of conserved avian genes. Gene prediction was performed using both ab initio and homology-based strategies, resulting in 16,048 protein-coding genes. This resource provides a foundation for genomewide analyses of inbreeding, demographic history, and adaptive variation, and will support evidencebased in situ and ex situ conservation strategies for this threatened species.

9
Biodiversity effects of beaver activity in a semi-natural enclosure revealed by eDNA

Hanfling, B.; Griffiths, N. P.; Macarthur, J. A.; Morrisey, B.; Svobodova, D.; Pritchard, V. L.; Tree, A.; Gaywood, M. J.

2026-05-16 ecology 10.64898/2026.05.15.725411 medRxiv
Top 0.1%
12.3%
Show abstract

O_LIEnvironmental DNA (eDNA) metabarcoding is an emerging tool for biodiversity assessment in freshwater systems, offering high-resolution insights into community composition. Here, we apply eDNA metabarcoding to evaluate the ecological impacts of Eurasian beaver (Castor fiber) activity within a seminatural enclosure in the Scottish Highlands. C_LIO_LIWe collected seasonal water samples from nine sites, six influenced by beaver dams and three control sites with no evidence of beaver engineering, across a 40-hectare enclosure. Samples were analysed for vertebrate and macroinvertebrate diversity using established 12S and COI markers. C_LIO_LIVertebrate alpha diversity did not differ significantly between beaver and control sites, likely reflecting the small spatial scale and low species richness of upland Scottish streams. However, community composition differed significantly between treatments, especially for fish (PERMANOVA, R2 = 0.55, P < 0.001), with beaver-influenced sites dominated by three-spined stickleback and control sites by brown trout. Macroinvertebrate communities showed a 78% increase in gamma diversity in beaver-modified habitats relative to controls. Species composition varied strongly with beaver presence (PERMANOVA, R2 = 0.29, P < 0.001), likely due to the creation of lentic-lotic mosaics and associated microhabitat diversity. Seasonal variation was significant in both taxonomic groups, with the lowest species richness and highest community dispersion observed in summer, probably reflecting hydrological and temperature-driven dynamics in eDNA production and transport. C_LIO_LIOur findings reinforce previous evidence that beaver dam-building activity enhances beta diversity in headwater systems. Additionally, we demonstrate that eDNA metabarcoding is a sensitive method for detecting spatial patterns in freshwater biodiversity associated with these activities at scales ranging from tens to hundreds of meters. These approaches could inform future monitoring strategies aligned with landscape-scale beaver management and reintroductions. C_LI

10
New chromosome-level haplotyped genome assemblies and annotation for the Japanese Quail (Coturnix Japonica)

Cabau, C.; Degalez, F.; Leroux, S.; Gourichon, D.; Serre, R.-F.; Vernette, C.; Donnadieu, C.; Iampietro, C.; Vandecasteele, C.; Pitel, F.; Klopp, C.

2026-05-14 genomics 10.64898/2026.05.12.724545 medRxiv
Top 0.1%
12.1%
Show abstract

The Japanese quail (Coturnix japonica) is a widely used model organism in developmental biology, genetics, and agriculture. Here, we present new, haplotyped, high-quality genome assemblies of the Japanese quail, generated using a combination of state-of-the-art sequencing technologies, including PacBio HiFi long reads, Oxford Nanopore sequencing, and Hi-C scaffolding. This assembly has a total length of 1.19 Gb, 80% of which is included in chromosomes, and is highly complete (BUSCO score aves_odb10: 97.3). Assembly metrics show a marked improvement in contiguity, with a significantly higher scaffold N50 and a lower number of contigs compared to the reference genome assembly. Remarkably, the assembly extends previously truncated chromosome ends, with 31 telomeres detected. In addition, we merged the existing Ensembl and Refseq annotations and obtained a combined set of 26,102 genes, of which 25,038 genes were successfully mapped on the improved assembly haplotype 1 (Cjap1.hap1). Together, these new genome assemblies and their enriched annotation provide a robust genomic framework for future research. They enhance our ability to investigate developmental processes, genetic and epigenetic inheritance, and host-pathogen interactions. Furthermore, they offer valuable insights for conservation genetics and sustainable breeding programs. This resource represents a critical step forward in leveraging the full potential of the Japanese quail as a model species in both basic and applied research.

11
Towards the reliable use of aerial eDNA for ecosystem monitoring

Sokal, N.; Urbez-Torres, J. R.; Da Ros, L.

2026-05-21 genomics 10.64898/2026.05.19.726284 medRxiv
Top 0.1%
10.3%
Show abstract

Evidence supporting the use of airborne eDNA for biodiversity studies and ecosystem monitoring is growing. The promise of wide-area population dynamics data for downstream applications in targeted monitoring of pests and pathogens for agriculture and rare species for conservation is appealing; however, several technical challenges persist. Here, we focused on the development of a comprehensive dataset to facilitate assay development and accelerate the use of aerial sampling for species detection. Year-round metabarcoding data was generated using bacterial, fungal, plant, and arthropod primer sets and resulted in relative abundance estimates for 4,960 amplicon sequence variants (ASVs), 1,748 ASVs of which were assigned to a minimum taxonomic level of genus (bacteria, fungi, plants) or class (arthropods). Sequence diversity assessments and seasonal clustering based on presence/absence detection patterns were performed for individual ASVs, while discerning quantitative changes in seasonal abundance required grouping ASVs to at least the genus level. Examination of the technical aspects of metabarcoding suggested that the use of subsampling allows for consistent detection of genera with relative abundance values above 2 %, even when samples have varying sequencing depths. Sequencing depth was the primary determinate for detecting sporadic and/or rare ASVs. Sampler comparisons, common sources of variation, and the benefits of barcoding regional species to supplement the existing taxonomic databases were discussed. Insufficient knowledge of sampler coverage area for the different organism types was identified as a limitation to the deployment of aerial monitoring networks. Considerations for further aerial metabarcoding efforts are suggested based on our experimental findings. ImportanceOur study deals directly with the generation, analysis and limitations of airborne eDNA metabarcoding data for re-use by the broader environmental research community. This includes timing of seasonal detection for possible genera of interest across multiple kingdoms, including bacteria, fungi, plants and animals (specifically arthropods), and support for the generation of local databases to assess the current limitations of universal primers for species/genus taxonomic resolution. With regards to methodology, it continues to build upon established best practices for airborne eDNA collection in areas such as sub-sampling and sampling replicates, sampler type and sequencing depth. To accelerate possible uptake and application of the data, we provide the identified ASVs and their seasonal relative abundances as a resource.

12
Detecting terrestrial insects from naturally exuding tree sap using environmental DNA: a pilot study

Kawakami, H.; Yuasa, H.; Kuroda, H.; Ichinose, T.

2026-05-18 ecology 10.64898/2026.05.16.724188 medRxiv
Top 0.1%
9.8%
Show abstract

Terrestrial environmental DNA (eDNA) approaches are rapidly expanding, yet robust, field-ready substrates for detecting insect DNA remain limited in forest ecosystems. Tree sap is a localized microhabitat that attracts diverse insects and may provide a useful substrate for surface eDNA sampling, but its potential for insect monitoring has rarely been evaluated. Here, we present a pilot proof-of-concept study testing naturally exuding tree sap and sap-mimicking traps as terrestrial eDNA substrates. We collected swab samples from sap and trap surfaces at two forest sites in Japan (Fujisawa and Minamisanriku) and performed metabarcoding using COI and an arthropod-focused 16S marker (gInsect). Reads were processed into amplicon sequence variants and assigned by BLAST top hits against NCBI nt, with high-confidence detections defined at identity [&ge;]98%. Across sites, sap and trap swabs yielded multiple high-confidence insect detections spanning several orders, including sap-associated stag beetles (Dorcus spp.). Overlap with contemporaneous conventional monitoring was limited, suggesting that sap-surface eDNA and conventional surveys capture partly different components of sap-associated insect assemblages. In a targeted 2024 spot survey, actively fermenting sap yielded multiple insect eDNA detections, whereas inactive, non-fermented sap yielded no high-confidence insect detections. Although limited by small sample size and the absence of dedicated process controls, these findings support the feasibility of tree sap as a localized terrestrial eDNA substrate and provide a basis for future replicated studies of sap-associated insect monitoring.

13
Substitution rate variation, not hidden paralogy, drives false hybridization signal in phylogenetic network inference

Li, B.; Ane, C.

2026-05-18 evolutionary biology 10.64898/2026.05.11.723986 medRxiv
Top 0.2%
6.5%
Show abstract

Phylogenetic network inference methods are increasingly used to detect hybridization and gene flow from genomic data, but their robustness to common sources of model violation remains poorly characterized. We conducted a simulation study to evaluate the effects of hidden paralogy and substitution rate variation on two widely used network inference methods: find_graphs from ADMIXTOOLS 2 and SNaQ. Using an eight-taxon species tree calibrated from an empirical reptile phylogeny, we simulated data under various levels of hidden paralogy (from none to strong) and three levels of rate variation (none, gene-specific, and lineage-specific). We found that hidden paralogy had limited impact on network inference under the conditions examined: both network methods correctly favored a tree without reticulation, and ASTRAL recovered the correct species tree every time. In contrast, lineage-specific rates severely biased find_graphs, inflating worst f-statistic residuals well beyond the standard acceptance threshold. SNaQ correctly selected a tree model almost always across all conditions, though its network with h = 1 reticulation displayed the true species tree with a lower probability under lineage-specific rates. We also show that the standard worst residuals threshold of 3 for find_graphs produces inflated type I error even without rate variation, and we recommend empirical calibration of this threshold within each study system.

14
Assessing the Efficacy of Computational Workshops and Participatory Live Coding in Evolutionary Biology

Swiston, S. K.; Kuehne, L.; Moore, R.; Landis, M. J.

2026-05-06 evolutionary biology 10.64898/2026.05.04.722624 medRxiv
Top 0.2%
6.5%
Show abstract

Computational workshops are common in evolutionary biology and are used to share discipline-specific tools and skills with researchers. Despite the perceived importance of these workshops, there is no common set of criteria for workshop success, and there are few peer-reviewed studies investigating the efficacy of workshops or assessing the value of particular instructional techniques in this context. Here, we focused on one key element of a successful workshop: its ability to increase participants motivation to use the methods and tools presented during the workshop. We analyzed the goals, perceptions, and future plans of research practitioners engaging in a workshop on phylogenetic methods of historical biogeography using pre- and post-workshop surveys. Overall, the workshop was successful at motivating participants, and survey responses provided insights into participants perceptions of different activities, including "participatory live coding". Apart from this case study, we aim to highlight the importance of developing a common set of workshop goals in collaboration with other workshop stakeholders and the need for specialized, validated tools for assessing the efficacy of computational workshops for researchers.

15
A songbird karyotype: cytogenetic confirmation of a migration-associated region rich in olfactory receptor genes.

Caballero Lopez, V.; Dedukh, D.; Ekman, D.; Kauzal, O.; Lundberg, M.; Odenthal-Hesse, L.; Proux-Wera, E.; Reifova, R.; Reif, J.; Altmanova, M.; Trifonov, V.; Bensch, S.

2026-05-07 genomics 10.64898/2026.05.04.721007 medRxiv
Top 0.2%
4.8%
Show abstract

The field of genetics of bird migration advances, driven by exponential refinements of sequencing and tracking technologies. In willow warblers (Phylloscopus trochilus), a complex repeat-rich region named MARB (Migration Associated Repeat Block) has recently been found to correlate with the routes taken by individual birds from Europe to their African wintering grounds. However, the genomic location of this region remains unknown. Here, we characterized MARB using a combination of approaches to understand how it evolved. We describe the region using long-read genome assemblies of two willow warbler subspecies (P. t. trochilus and P. t. acredula), two related species, the common chiffchaff (P. collybita) and the greenish warbler (P. trochiloides), and whole genome sequencing data from 76 willow warblers. Finally, we applied karyotyping and fluorescent in situ hybridization techniques on willow warbler spermatocytes to cytogenetically locate MARB. Due to the many repeats, we cannot order scaffolds in silico, but probe hybridization on the karyotype shows that MARB constitutes a single locus (~27.5 Mb) spanning most of the 11th largest chromosome in the willow warbler genome. Interestingly, the MARB regions of all species share several characteristics such as relatively high GC content (50%), a high density of specific repeat families and notably, more than 800 olfactory receptor sequences. Regions homologous to MARB may exist in several migrant bird genomes, though currently unassembled due to their complexity. Resolving these in species with similar migratory polymorphisms to willow warblers will be essential to determine whether MARB influences migratory behaviour across species.

16
Tracing the intruders: a global appraisal of marine invasive species detection through DNA-based approaches

Duarte, S.; Costa, F.

2026-05-07 ecology 10.64898/2026.05.05.722998 medRxiv
Top 0.2%
4.7%
Show abstract

Early detection and monitoring of non-indigenous species (NIS) is crucial to prevent their establishment and to reduce ecological and economic impacts in coastal ecosystems. Traditional monitoring approaches, which rely largely on morphological identification of collected organisms, are often time-consuming and may fail to detect species that occur at low abundance, are morphologically cryptic, or are present in the form of inconspicuous life stages. DNA-based approaches, particularly those resorting to environmental DNA, have demonstrated high aptitude for biodiversity monitoring and biosecurity surveillance. By examining the genetic material from bulk community samples or released into the environment, DNA-based approaches enable the detection of species without the need for direct observation, thereby increasing detection sensitivity and expanding the scope of monitoring programs. Despite the rapid growth of its employment in marine monitoring, a global synthesis of the status and trends of DNA-based approaches for detecting NIS in this environment has been lacking. Here, we present such synthesis, based on 146 published studies employing DNA for NIS detections in coastal environments. Two main methodological approaches were used across the reviewed studies, namely DNA metabarcoding which was applied in 49% of studies, closely followed by targeted single-species PCR assays, used in 42% of the studies. A smaller proportion of studies (10%) combined both approaches, integrating broad community screening with targeted detection to improve surveillance efficiency. Globally, 752 NIS were detected across disparate taxonomic groups, with metazoans representing the largest proportion of detections (464 species), followed by Chromista (210 species) and Plantae (77 species). Among these, the most frequently detected taxonomic groups included Dinophyceae (Dinoflagellata), Teleostei (Chordata), Florideophyceae (Rodophyta), Polychaeta (Annelida), Copepoda and Malacostraca (Arthropoda), and Ascidiacea (Chordata). At the species level, several well-known marine invaders were recurrently reported, including Bugula neritina (Linnaeus, 1758), Styela plicata (Lesueur, 1823), Acartia (Acanthacartia) tonsa Dana, 1849-1852, and Botryllus schlosseri (Pallas, 1766), highlighting the ability of DNA approaches to detect widespread and established invaders across different regions. The mitochondrial cytochrome c oxidase subunit I (COI) gene was the most widely used genetic marker, reflecting its broad taxonomic coverage and extensive representation in reference databases, particularly for targeting Metazoa. Ribosomal RNA genes, particularly 18S and 16S rRNA gene markers, were also frequently employed to target a wider range of eukaryotic taxa. Regarding sampled substrates, water was by far the most analyzed substrate, followed by zooplankton and biofouling communities collected from man-made structures. Notably, approximately 31% of all NIS detections reported in the reviewed studies constituted new regional records. These results highlight the potential of eDNA for coastal monitoring but also underline important limitations. Persistent geographical, taxonomic, and methodological biases can affect detection outcomes, and reliance on single sample types or markers may increase false negatives - particularly critical for NIS early detection. Therefore, multi-marker and multi-substrate approaches are essential to improve detection reliability and support effective biosecurity strategies. As reference databases continue to expand and methodological protocols become increasingly standardized, DNA-based monitoring is likely to play a central role in future management and surveillance of biological invasions in coastal ecosystems. Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=133 SRC="FIGDIR/small/722998v1_ufig1.gif" ALT="Figure 1"> View larger version (75K): org.highwire.dtl.DTLVardef@17948b1org.highwire.dtl.DTLVardef@193832dorg.highwire.dtl.DTLVardef@189033dorg.highwire.dtl.DTLVardef@33cddf_HPS_FORMAT_FIGEXP M_FIG C_FIG

17
Efficient and Robust Genomic DNA Isolation and Next-Generation Sequencing Library Preparation from Recalcitrant Wild Grape Species

Bhattarai, A.; Smith, J.; Abdelgaffar, H.; Carpenter, R.; Mishra, S.; Fuentes, J. L. J.; Shirsekar, G.

2026-05-21 genomics 10.64898/2026.05.19.713680 medRxiv
Top 0.3%
4.3%
Show abstract

This protocol details the extraction of high-molecular-weight genomic DNA from grapevine tissues (wild and cultivated Vitis spp., including pathogen-infected samples) and the subsequent preparation of Illumina(R) whole-genome sequencing libraries using bead-bound Tn5 transposase. It is designed to overcome challenges from polyphenolic compounds and secondary metabolites in wild plants, providing a cost-effective workflow for large-scale population genomics. It includes recipes for buffers, incubation times, critical notes, and troubleshooting tips to maximize yield and library quality. Although designed for the grapevine DNA, this protocol is potentially applicable to other similar wild plant species HighlightsO_LIOptimized CTAB-PTB DNA extraction protocol for field-collected wild plant tissues. C_LIO_LIEffective removal of polyphenols and secondary metabolites associated with DNA using PTB. C_LIO_LICost-effective Illumina DNA Prep library preparation using bead-bound Tn5 transposase (Tagmentation). C_LIO_LIScalable workflow suitable for large-scale population genomics in Vitis species. C_LIO_LIValidated method for high-molecular-weight DNA and high-quality sequencing data. C_LI Graphical Abstract O_FIG O_LINKSMALLFIG WIDTH=200 HEIGHT=195 SRC="FIGDIR/small/713680v1_ufig1.gif" ALT="Figure 1"> View larger version (31K): org.highwire.dtl.DTLVardef@b637d4org.highwire.dtl.DTLVardef@10b563aorg.highwire.dtl.DTLVardef@14a32caorg.highwire.dtl.DTLVardef@4c9577_HPS_FORMAT_FIGEXP M_FIG C_FIG

18
Promises and limitations of local ancestry inference in imputed ancient genomes

Bougiouri, K.; Irving-Pease, E. K.; Frantz, L. A. F.; Racimo, F.; Petr, M.

2026-05-20 evolutionary biology 10.64898/2026.05.19.725905 medRxiv
Top 0.3%
4.3%
Show abstract

Recent advances in genome imputation have enabled the application of state-of-the-art statistical methods--originally developed for present-day genomes--to ancient genomes. One class of such methods, known as local ancestry inference (LAI), can model an individuals genome as a mosaic of tracts assigned to different putative ancestral sources, revealing patterns of genetic ancestry across the genome. However, most LAI methods have been designed to study recent admixture events in human history, and they generally assume large panels of present-day genomes. Despite the recent availability of high-quality imputed ancient genomes, it remains unknown to what degree LAI inference is reliable for such datasets. Ancient DNA is often characterized by heterogeneous geographic and temporal sampling, varying degrees of divergence between ancient source proxies and admixing populations, and complex demographic histories. Here, we performed an extensive set of population genetic simulations to evaluate the accuracy of four popular LAI methods-RFMix, FLARE, MOSAIC and simpLAI-under different demographic scenarios, various temporal sampling schemes, sample sizes, and admixture dates. We quantify the accuracy of these methods as a function of different parameters in practically relevant scenarios, and provide general guidelines for future studies utilizing LAI in ancient DNA research.

19
A Rarefaction Approach to Identify Local Introgression in a Three Population Tree

Smith, T. Q.; Szpiech, Z. A.

2026-05-16 evolutionary biology 10.64898/2026.05.13.724952 medRxiv
Top 0.3%
4.0%
Show abstract

Pattersons D statistic, also known as the ABBA-BABA statistic, is widely used to detect the presence of archaic genome-wide introgression between two non-sister taxa. Requiring only a single lineage from each of four taxa where one taxon acts as an outgroup to determine the ancestral allele, Pattersons D, counts the imbalance between the number of biallelic sites where either the second and third taxa (ABAB site) or the first and third taxa (BABA site). When there is no introgression, these counts are expected to be equal, and a discordance between counts suggests introgression from the third taxon into either the first or second. Pattersons D is limited to the detection of genome-wide introgression and exhibits a high false-positive rate when applied to smaller genomic segments. Here, we present a new method, D STatistic with Allelic Rarefaction (D*), to address these limitations. D* uses multiple lineages and does not require an outgroup to calculate the imbalance between the number of alleles found exclusively in the second and third taxa and the number of alleles found exclusively in the first and third taxa. D* employs a rarefaction technique to correct for unequal sample-size and allows multiallelic sites. We use simulations to show that D* has better precision and recall for detecting introgressed segments of DNA when compared to similar methods under a wide variety of model parameters and in the presence of technical artifacts common to ancient DNA analyses. We conclude with an analysis of Denisovan DNA introgression in modern day Papuans. Precompiled executables, the manual, and source code can be found at https://github.com/TQ-Smith/DSTAR

20
Evaluating non-lethal tissue suitability for telomere length measurement in the Japanese eel

Moriguchi, Y.; Kimura, S. S.; Kume, M.; Takagi, J.; Uno, Y.; Kanoh, J.; Mitamura, H.

2026-05-13 molecular biology 10.64898/2026.05.09.723945 medRxiv
Top 0.3%
3.6%
Show abstract

Telomere length (TL) is increasingly used in ecology as a biomarker of individual quality and environmental stress, yet research on non-model species with complex life histories remains limited. Because TL varies among tissues and across ages in a species-specific manner, identifying non-lethal tissues that reliably reflect whole-organism telomere dynamics is essential for longitudinal telomere studies in the field. This study aimed to evaluate tissue-specific TL in Japanese eel (Anguilla japonica), an endangered catadromous fish. We first mapped the chromosomal distribution of telomeric sequences using fluorescent in situ hybridization (FISH), the first application of this method in this species. We then tested whether muscle and caudal fin, which can be sampled easily and non-lethally, can serve as suitable proxy tissues for TL measurements in wild individuals. Relative telomere length (RTL) was quantified by qPCR in blood, brain, caudal fin, gonads, heart, liver, and muscle. FISH analysis confirmed telomeric repeats at all chromosomal ends, with only weak interstitial signals on three chromosomal pairs unlikely to affect qPCR-based estimates. A generalized additive mixed model and Wilcoxons signed-rank tests revealed significant inter-tissue differences: RTL was shortest in the brain and muscle and longest in liver, blood and caudal fin. Muscle and caudal fin RTL were significantly correlated with RTL in many other tissues, supporting their use as proxy tissues for longitudinal TL monitoring, including responses to environmental variation. Both total length and age were tested as explanatory variables for RTL, and the model including total length showed a better fit than the age-based model. Non-linear relationships between RTL and total length observed in several tissues suggest physiological shifts associated with growth and sexual differentiation. Overall, these findings advance understanding of telomere dynamics in eels and establish muscle and caudal fin as suitable tissues for repeated, non-lethal TL assessment in ecological and conservation contexts.